Search for: All records

Creators/Authors contains: "Mitchell, Cassie S"

« Prev Next »

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Comprehensive Anatomical Staging Predicts Clinical Progression in Mild Cognitive Impairment: A Data-Driven Approach

https://doi.org/10.3390/ijms26125514

Tandon, Raghav; Mei, Yajun; Lah, James J; Mitchell, Cassie S (June 2025, International Journal of Molecular Sciences)

Alzheimer’s disease (AD) presents significant challenges in clinical practice due to its heterogeneous manifestation and variable progression rates. This work develops a comprehensive anatomical staging framework to predict progression from mild cognitive impairment (MCI) to AD. Using the ADNI database, the scalable Subtype and Stage Inference (s-SuStaIn) model was applied to 118 neuroanatomical features from cognitively normal (n = 504) and AD (n = 346) participants. The framework was validated on 808 MCI participants through associations with clinical progression, CSF and FDG-PET biomarkers, and neuropsychiatric measures, while adjusting for common confounders (age, gender, education, and APOE ε4 alleles). The framework demonstrated superior prognostic accuracy compared to traditional risk assessment (C-index = 0.73 vs. 0.62). Four distinct disease subtypes showed differential progression rates, biomarker profiles (FDG-PET and CSF Aβ42), and cognitive trajectories: Subtype 1, subcortical-first pattern; Subtype 2, executive–cortical pattern; Subtype 3, disconnection pattern; and Subtype 4, frontal–executive pattern. Stage-dependent changes revealed systematic deterioration across diverse cognitive domains, particularly in learning acquisition, visuospatial processing, and functional abilities. This data-driven approach captures clinically meaningful disease heterogeneity and improves prognostication in MCI, potentially enabling more personalized therapeutic strategies and clinical trial design.
more » « less
Free, publicly-accessible full text available June 1, 2026
Network Diffusion-Constrained Variational Generative Models for Investigating the Molecular Dynamics of Brain Connectomes Under Neurodegeneration

https://doi.org/10.3390/ijms26031062

Xie, Jiajia; Tandon, Raghav; Mitchell, Cassie S (February 2025, International Journal of Molecular Sciences)

Alzheimer’s disease (AD) is a complex and progressive neurodegenerative condition with significant societal impact. Understanding the temporal dynamics of its pathology is essential for advancing therapeutic interventions. Empirical and anatomical evidence indicates that network decoupling occurs as a result of gray matter atrophy. However, the scarcity of longitudinal clinical data presents challenges for computer-based simulations. To address this, a first-principles-based, physics-constrained Bayesian framework is proposed to model time-dependent connectome dynamics during neurodegeneration. This temporal diffusion network framework segments pathological progression into discrete time windows and optimizes connectome distributions for biomarker Bayesian regression, conceptualized as a learning problem. The framework employs a variational autoencoder-like architecture with computational enhancements to stabilize and improve training efficiency. Experimental evaluations demonstrate that the proposed temporal meta-models outperform traditional static diffusion models. The models were evaluated using both synthetic and real-world MRI and PET clinical datasets that measure amyloid beta, tau, and glucose metabolism. The framework successfully distinguishes normative aging from AD pathology. Findings provide novel support for the “decoupling” hypothesis and reveal eigenvalue-based evidence of pathological destabilization in AD. Future optimization of the model, integrated with real-world clinical data, is expected to improve applications in personalized medicine for AD and other neurodegenerative diseases.
more » « less
Free, publicly-accessible full text available February 1, 2026
BioEL: A Comprehensive Python Package for Biomedical Entity Linking

https://doi.org/10.18653/v1/2025.findings-naacl.93

Bathala, Prasanth; Ye, Christophe; Nursal, Batuhan; Lohiya, Shubham; Kartchner, David; Mitchell, Cassie S (January 2025, Association for Computational Linguistics)

Full Text Available
Artificial Intelligence-Assisted Comparative Analysis of the Overlapping Molecular Pathophysiology of Alzheimer’s Disease, Amyotrophic Lateral Sclerosis, and Frontotemporal Dementia

https://doi.org/10.3390/ijms252413450

Wei, Zihan; Iyer, Meghna R; Zhao, Benjamin; Deng, Jennifer; Mitchell, Cassie S (December 2024, International Journal of Molecular Sciences)

The overlapping molecular pathophysiology of Alzheimer’s Disease (AD), Amyotrophic Lateral Sclerosis (ALS), and Frontotemporal Dementia (FTD) was analyzed using relationships from a knowledge graph of 33+ million biomedical journal articles. The unsupervised learning rank aggregation algorithm from SemNet 2.0 compared the most important amino acid, peptide, and protein (AAPP) nodes connected to AD, ALS, or FTD. FTD shared 99.9% of its nodes with ALS and AD; AD shared 64.2% of its nodes with FTD and ALS; and ALS shared 68.3% of its nodes with AD and FTD. The results were validated and mapped to functional biological processes using supervised human supervision and an external large language model. The overall percentages of mapped intersecting biological processes were as follows: inflammation and immune response, 19%; synapse and neurotransmission, 19%; cell cycle, 15%; protein aggregation, 12%; membrane regulation, 11%; stress response and regulation, 9%; and gene regulation, 4%. Once normalized for node count, biological mappings for cell cycle regulation and stress response were more prominent in the intersection of AD and FTD. Protein aggregation, gene regulation, and energetics were more prominent in the intersection of ALS and FTD. Synapse and neurotransmission, membrane regulation, and inflammation and immune response were greater at the intersection of AD and ALS. Given the extensive molecular pathophysiology overlap, small differences in regulation, genetic, or environmental factors likely shape the underlying expressed disease phenotype. The results help prioritize testable hypotheses for future clinical or experimental research.
more » « less
Full Text Available
Restoring Homeostasis: Treating Amyotrophic Lateral Sclerosis by Resolving Dynamic Regulatory Instability

https://doi.org/10.3390/ijms26030872

Lee, Albert_J B; Bi, Sarah; Ridgeway, Eleanor; Al-Hussaini, Irfan; Deshpande, Sakshi; Krueger, Adam; Khatri, Ahad; Tsui, Dennis; Deng, Jennifer; Mitchell, Cassie S (February 2025, International Journal of Molecular Sciences)

Amyotrophic lateral sclerosis (ALS) has an interactive, multifactorial etiology that makes treatment success elusive. This study evaluates how regulatory dynamics impact disease progression and treatment. Computational models of wild-type (WT) and transgenic SOD1-G93A mouse physiology dynamics were built using the first-principles-based first-order feedback framework of dynamic meta-analysis with parameter optimization. Two in silico models were developed: a WT mouse model to simulate normal homeostasis and a SOD1-G93A ALS model to simulate ALS pathology dynamics and their response to in silico treatments. The model simulates functional molecular mechanisms for apoptosis, metal chelation, energetics, excitotoxicity, inflammation, oxidative stress, and proteomics using curated data from published SOD1-G93A mouse experiments. Temporal disease progression measures (rotarod, grip strength, body weight) were used for validation. Results illustrate that untreated SOD1-G93A ALS dynamics cannot maintain homeostasis due to a mathematical oscillating instability as determined by eigenvalue analysis. The onset and magnitude of homeostatic instability corresponded to disease onset and progression. Oscillations were associated with high feedback gain due to hypervigilant regulation. Multiple combination treatments stabilized the SOD1-G93A ALS mouse dynamics to near-normal WT homeostasis. However, treatment timing and effect size were critical to stabilization corresponding to therapeutic success. The dynamics-based approach redefines therapeutic strategies by emphasizing the restoration of homeostasis through precisely timed and stabilizing combination therapies, presenting a promising framework for application to other multifactorial neurodegenerative diseases.
more » « less
Free, publicly-accessible full text available February 1, 2026
TrialSieve: A Comprehensive Biomedical Information Extraction Framework for PICO, Meta-Analysis, and Drug Repurposing

https://doi.org/10.3390/bioengineering12050486

Kartchner, David; Turner, Haydn; Ye, Christophe; Al-Hussaini, Irfan; Nursal, Batuhan; Lee, Albert_J B; Deng, Jennifer; Curtis, Courtney; Cho, Hannah; Duvaris, Eva L; et al (May 2025, Bioengineering)

This work introduces TrialSieve, a novel framework for biomedical information extraction that enhances clinical meta-analysis and drug repurposing. By extending traditional PICO (Patient, Intervention, Comparison, Outcome) methodologies, TrialSieve incorporates hierarchical, treatment group-based graphs, enabling more comprehensive and quantitative comparisons of clinical outcomes. TrialSieve was used to annotate 1609 PubMed abstracts, 170,557 annotations, and 52,638 final spans, incorporating 20 unique annotation categories that capture a diverse range of biomedical entities relevant to systematic reviews and meta-analyses. The performance (accuracy, precision, recall, F1-score) of four natural-language processing (NLP) models (BioLinkBERT, BioBERT, KRISSBERT, PubMedBERT) and the large language model (LLM), GPT-4o, was evaluated using the human-annotated TrialSieve dataset. BioLinkBERT had the best accuracy (0.875) and recall (0.679) for biomedical entity labeling, whereas PubMedBERT had the best precision (0.614) and F1-score (0.639). Error analysis showed that NLP models trained on noisy, human-annotated data can match or, in most cases, surpass human performance. This finding highlights the feasibility of fully automating biomedical information extraction, even when relying on imperfectly annotated datasets. An annotator user study (n = 39) revealed significant (p < 0.05) gains in efficiency and human annotation accuracy with the unique TrialSieve tree-based annotation approach. In summary, TrialSieve provides a foundation to improve automated biomedical information extraction for frontend clinical research.
more » « less
Free, publicly-accessible full text available May 1, 2026
Data Augmentation with Cross-Modal Variational Autoencoders (DACMVA) for Cancer Survival Prediction

https://doi.org/10.3390/info15010007

Rajaram, Sara; Mitchell, Cassie S. (January 2024, Information)

The ability to translate Generative Adversarial Networks (GANs) and Variational Autoencoders (VAEs) into different modalities and data types is essential to improve Deep Learning (DL) for predictive medicine. This work presents DACMVA, a novel framework to conduct data augmentation in a cross-modal dataset by translating between modalities and oversampling imputations of missing data. DACMVA was inspired by previous work on the alignment of latent spaces in Autoencoders. DACMVA is a DL data augmentation pipeline that improves the performance in a downstream prediction task. The unique DACMVA framework leverages a cross-modal loss to improve the imputation quality and employs training strategies to enable regularized latent spaces. Oversampling of augmented data is integrated into the prediction training. It is empirically demonstrated that the new DACMVA framework is effective in the often-neglected scenario of DL training on tabular data with continuous labels. Specifically, DACMVA is applied towards cancer survival prediction on tabular gene expression data where there is a portion of missing data in a given modality. DACMVA significantly (p << 0.001, one-sided Wilcoxon signed-rank test) outperformed the non-augmented baseline and competing augmentation methods with varying percentages of missing data (4%, 90%, 95% missing). As such, DACMVA provides significant performance improvements, even in very-low-data regimes, over existing state-of-the-art methods, including TDImpute and oversampling alone.
more » « less
Full Text Available
Cross-Domain Text Mining of Pathophysiological Processes Associated with Diabetic Kidney Disease

https://doi.org/10.3390/ijms25084503

Patidar, Krutika; Deng, Jennifer H.; Mitchell, Cassie S.; Ford_Versypt, Ashlee N (April 2024, International Journal of Molecular Sciences)

Full Text Available
What Threshold of Amyloid Reduction Is Necessary to Meaningfully Improve Cognitive Function in Transgenic Alzheimer’s Disease Mice?

https://doi.org/10.3233/ADR-230174

Singh, Anita; Maker, Matthew; Prakash, Jayant; Tandon, Raghav; Mitchell, Cassie S. (February 2024, Journal of Alzheimer's Disease Reports)

Background: Amyloid-β plaques (Aβ) are associated with Alzheimer’s disease (AD). Pooled assessment of amyloid reduction in transgenic AD mice is critical for expediting anti-amyloid AD therapeutic research. Objective: The mean threshold of Aβ reduction necessary to achieve cognitive improvement was measured via pooled assessment (n = 594 mice) of Morris water maze (MWM) escape latency of transgenic AD mice treated with substances intended to reduce Aβ via reduction of beta-secretase cleaving enzyme (BACE). Methods: Machine learning and statistical methods identified necessary amyloid reduction levels using mouse data (e.g., APP/PS1, LPS, Tg2576, 3xTg-AD, control, wild type, treated, untreated) curated from 22 published studies. Results: K-means clustering identified 4 clusters that primarily corresponded with level of Aβ: untreated transgenic AD control mice, wild type mice, and two clusters of transgenic AD mice treated with BACE inhibitors that had either an average 25% “medium reduction” of Aβ or 50% “high reduction” of Aβ compared to untreated control. A 25% Aβ reduction achieved a 28% cognitive improvement, and a 50% Aβ reduction resulted in a significant 32% improvement compared to untreated transgenic mice (p < 0.05). Comparatively, wild type mice had a mean 41% MWM latency improvement over untreated transgenic mice (p < 0.05). BACE reduction had a lesser impact on the ratio of Aβ42 to Aβ40. Supervised learning with an 80% –20% train-test split confirmed Aβ reduction was a key feature for predicting MWM escape latency (R2 = 0.8 to 0.95). Conclusions: Results suggest a 25% reduction in Aβ as a meaningful treatment threshold for improving transgenic AD mouse cognition.
more » « less
Full Text Available
SeizFt: Interpretable Machine Learning for Seizure Detection Using Wearables

https://doi.org/10.3390/bioengineering10080918

Al-Hussaini, Irfan; Mitchell, Cassie S. (August 2023, Bioengineering)

This work presents SeizFt—a novel seizure detection framework that utilizes machine learning to automatically detect seizures using wearable SensorDot EEG data. Inspired by interpretable sleep staging, our novel approach employs a unique combination of data augmentation, meaningful feature extraction, and an ensemble of decision trees to improve resilience to variations in EEG and to increase the capacity to generalize to unseen data. Fourier Transform (FT) Surrogates were utilized to increase sample size and improve the class balance between labeled non-seizure and seizure epochs. To enhance model stability and accuracy, SeizFt utilizes an ensemble of decision trees through the CatBoost classifier to classify each second of EEG recording as seizure or non-seizure. The SeizIt1 dataset was used for training, and the SeizIt2 dataset for validation and testing. Model performance for seizure detection was evaluated using two primary metrics: sensitivity using the any-overlap method (OVLP) and False Alarm (FA) rate using epoch-based scoring (EPOCH). Notably, SeizFt placed first among an array of state-of-the-art seizure detection algorithms as part of the Seizure Detection Grand Challenge at the 2023 International Conference on Acoustics, Speech, and Signal Processing (ICASSP). SeizFt outperformed state-of-the-art black-box models in accurate seizure detection and minimized false alarms, obtaining a total score of 40.15, combining OVLP and EPOCH across two tasks and representing an improvement of ~30% from the next best approach. The interpretability of SeizFt is a key advantage, as it fosters trust and accountability among healthcare professionals. The most predictive seizure detection features extracted from SeizFt were: delta wave, interquartile range, standard deviation, total absolute power, theta wave, the ratio of delta to theta, binned entropy, Hjorth complexity, delta + theta, and Higuchi fractal dimension. In conclusion, the successful application of SeizFt to wearable SensorDot data suggests its potential for real-time, continuous monitoring to improve personalized medicine for epilepsy.
more » « less
Full Text Available

« Prev Next »